AITopics | probabilistic value

Collaborating Authors

probabilistic value

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Regression-adjusted Monte Carlo Estimators for Shapley Values and Probabilistic Values

Neural Information Processing SystemsJun-15-2026, 12:26:32 GMT

With origins in game theory, probabilistic values like Shapley values, Banzhaf values, and semi-values have emerged as a central tool in explainable AI. They are used for feature attribution, data attribution, data valuation, and more. Since all of these values require exponential time to compute exactly, research has focused on efficient approximation methods using two techniques: Monte Carlo sampling and linear regression formulations. In this work, we present a new way of combining both of these techniques. Our approach is more flexible than prior algorithms, allowing for linear regression to be replaced with any function family whose probabilistic values can be computed efficiently. This allows us to harness the accuracy of tree-based models like XGBoost, while still producing unbiased estimates. From experiments across eight datasets, we find that our methods give state-of-the-art performance for estimating probabilistic values. For Shapley values, the error of our methods can be 6.5 lower than Permutation SHAP (the most popular Monte Carlo method), 3.8 lower than Kernel SHAP (the most popular linear regression method), and 2.6 lower than Leverage SHAP (the prior stateof-the-art Shapley value estimator). For more general probabilistic values, we can obtain error 215 lower than the best estimator from prior work.

artificial intelligence, machine learning, probabilistic value, (17 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (1.00)
Leisure & Entertainment > Games (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Regression-adjusted Monte Carlo Estimators for Shapley Values and Probabilistic Values

Neural Information Processing SystemsJun-11-2026, 00:33:44 GMT

With origins in game-theory, probabilistic values like Shapley values, Banzhaf values, and semi-values have emerged as a central tool in explainable AI. They are used for feature attribution, data attribution, data valuation, and more. Since all of these values require exponential time to compute exactly, research has focused on efficient approximation methods using two techniques: Monte Carlo sampling and linear regression formulations. In this work, we present a new way of combining both of these techniques. Our approach is more flexible than prior algorithms, allowing for linear regression to be replaced with any function family whose probabilistic values can be computed efficiently. This allows us to harness the accuracy of tree-based models like XGBoost, while still producing unbiased estimates. From experiments across eight datasets, we find that our methods give state-of-the-art performance for estimating probabilistic values. For Shapley values, the error of our methods is up to $6\times$ lower than Permutation SHAP (the most popular Monte Carlo method), $2.75\times$ lower than Kernel SHAP (the most popular linear regression method), and $1.75\times$ lower than Leverage SHAP (the prior state-of-the-art Shapley value estimator). For more general probabilistic values, we can obtain error up to $60\times$ lower than prior work.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

First-Order Efficiency for Probabilistic Value Estimation via A Statistical Viewpoint

Liu, Ziqi, Lee, Kiljae, Zhang, Yuan, Tang, Weijing

arXiv.org Machine LearningMay-5-2026

Probabilistic values, including Shapley values and semivalues, provide a model-agnostic framework to attribute the behavior of a black-box model to data points or features, with a wide range of applications including explainable artificial intelligence and data valuation. However, their exact computation requires utility evaluations over exponentially many coalitions, making Monte Carlo approximation essential in modern machine learning applications. Existing estimators are often developed through different identification strategies, including weighted averages, self-normalized weighting, regression adjustment, and weighted least squares. Our key observation is that these seemingly distinct constructions share a common first-order error structure, in which the leading term is an augmented inverse-probability weighted influence term determined by the sampling law and a working surrogate function. This first-order representation yields an explicit expression for the leading mean squared error (MSE), which characterizes how the sampling law and the surrogate jointly determine statistical efficiency. Guided by this criterion, we propose an Efficiency-Aware Surrogate-adjusted Estimator (EASE) that directly chooses the sampling law and surrogate to minimize the first-order MSE. We demonstrate that EASE consistently outperforms state-of-the-art estimators for various probabilistic values.

artificial intelligence, estimator, machine learning, (16 more...)

arXiv.org Machine Learning

2605.02827

Country: North America > United States (0.45)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

6b295b08549c0441914e391651423477-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 05:08:35 GMT

convergence rate, estimator, probabilistic value, (12 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Singapore (0.04)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Game Theory (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

One Sample Fits All: Approximating All Probabilistic Values Simultaneously and Efficiently

Li, Weida, Yu, Yaoliang

arXiv.org Artificial IntelligenceOct-31-2024

The concept of probabilistic values, such as Beta Shapley values and weighted Banzhaf values, has gained recent attention in applications like feature attribution and data valuation. However, exact computation of these values is often exponentially expensive, necessitating approximation techniques. Prior research has shown that the choice of probabilistic values significantly impacts downstream performance, with no universally superior option. Consequently, one may have to approximate multiple candidates and select the best-performing one. Although there have been many efforts to develop efficient estimators, none are intended to approximate all probabilistic values both simultaneously and efficiently. In this work, we embark on the first exploration of achieving this goal. Adhering to the principle of maximum sample reuse, we propose a one-sample-fits-all framework parameterized by a sampling vector to approximate intermediate terms that can be converted to any probabilistic value without amplifying scalars. Leveraging the concept of $ (\epsilon, \delta) $-approximation, we theoretically identify a key formula that effectively determines the convergence rate of our framework. By optimizing the sampling vector using this formula, we obtain i) a one-for-all estimator that achieves the currently best time complexity for all probabilistic values on average, and ii) a faster generic estimator with the sampling vector optimally tuned for each probabilistic value. Particularly, our one-for-all estimator achieves the fastest convergence rate on Beta Shapley values, including the well-known Shapley value, both theoretically and empirically. Finally, we establish a connection between probabilistic values and the least square regression used in (regularized) datamodels, showing that our one-for-all estimator can solve a family of datamodels simultaneously.

convergence rate, estimator, probabilistic value, (12 more...)

arXiv.org Artificial Intelligence

2410.23808

Country:

North America > United States (0.04)
Asia > Singapore (0.04)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

The Application of Affective Measures in Text-based Emotion Aware Recommender Systems

Leung, John Kalung, Griva, Igor, Kennedy, William G., Kinser, Jason M., Park, Sohyun, Lee, Seo Young

arXiv.org Artificial IntelligenceMay-4-2023

This paper presents an innovative approach to address the problems researchers face in Emotion Aware Recommender Systems (EARS): the difficulty and cumbersome collecting voluminously good quality emotion-tagged datasets and an effective way to protect users' emotional data privacy. Without enough good-quality emotion-tagged datasets, researchers cannot conduct repeatable affective computing research in EARS that generates personalized recommendations based on users' emotional preferences. Similarly, if we fail to fully protect users' emotional data privacy, users could resist engaging with EARS services. This paper introduced a method that detects affective features in subjective passages using the Generative Pre-trained Transformer Technology, forming the basis of the Affective Index and Affective Index Indicator (AII). Eliminate the need for users to build an affective feature detection mechanism. The paper advocates for a separation of responsibility approach where users protect their emotional profile data while EARS service providers refrain from retaining or storing it. Service providers can update users' Affective Indices in memory without saving their privacy data, providing Affective Aware recommendations without compromising user privacy. This paper offers a solution to the subjectivity and variability of emotions, data privacy concerns, and evaluation metrics and benchmarks, paving the way for future EARS research.

data mining, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2305.04796

Country:

North America > United States > Virginia > Fairfax County > Fairfax (0.05)
Asia > South Korea > Incheon > Incheon (0.04)
North America > United States > California > Ventura County > Thousand Oaks (0.04)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.04)

Genre: Research Report (0.84)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
(2 more...)

Add feedback

Logistic Regression -- An Overview with an Example

#artificialintelligenceAug-26-2020, 16:35:32 GMT

Known for its simplicity to understand, the Logistic Regression algorithm is very reliable and extremely useful, and that's why when it comes to binary classification problems, The Logistic Regression is any engineers go-to choice. The Logistic Regression uses the sigmoid function to output continuous probabilistic values between 0–1 for any value from its independent variables, and these probabilistic values are then compared against a threshold value of 0.5. Any value greater than 0.5 is classified in the "1 category class," and any value less than 0.5 is classified in the "0 category class" or the "class in which the particular event does not take place." A common question people ask is that, If Logistic Regression is used for classification problems, why does it have the "Regression" term in it? And why can't we use Linear Regression instead of Logistic Regression for classification problems? The answer to the first question is that Even though, The Logistic Regression is used for binary classification problems, The output from the sigmoid equation is still a continuous numerical value.

artificial intelligence, logistic regression, machine learning, (17 more...)

#artificialintelligence

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback